DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6...9
Hits 21 – 40 of 168

21
Bayesian phylogenetics, sequence alignment and the genetic structure of the Kainji languages. ...
Bacon, Geoff; Bird, Steven. - : Monash University, 2016
BASE
Show details
22
Learning Crosslingual Word Embeddings without Bilingual Corpora ...
BASE
Show details
23
Language Preservation 2.0: Crowdsourcing oral language documentation using mobile devices
Bird, Steven. - 2015
BASE
Show details
24
Documentary Linguistics and Computational Linguistics: A response to Brooks
Bird, Steven; Chiang, David; Frowein, Friedel. - : University of Hawaii Press, 2015
BASE
Show details
25
Practical Natural Language Processing for Low-Resource Languages.
Abstract: As the Internet and World Wide Web have continued to gain widespread adoption, the linguistic diversity represented has also been growing. Simultaneously the field of Linguistics is facing a crisis of the opposite sort. Languages are becoming extinct faster than ever before and linguists now estimate that the world could lose more than half of its linguistic diversity by the year 2100. This is a special time for Computational Linguistics; this field has unprecedented access to a great number of low-resource languages, readily available to be studied, but needs to act quickly before political, social, and economic pressures cause these languages to disappear from the Web. Most work in Computational Linguistics and Natural Language Processing (NLP) focuses on English or other languages that have text corpora of hundreds of millions of words. In this work, we present methods for automatically building NLP tools for low-resource languages with minimal need for human annotation in these languages. We start first with language identification, specifically focusing on word-level language identification, an understudied variant that is necessary for processing Web text and develop highly accurate machine learning methods for this problem. From there we move onto the problems of part-of-speech tagging and dependency parsing. With both of these problems we extend the current state of the art in projected learning to make use of multiple high-resource source languages instead of just a single language. In both tasks, we are able to improve on the best current methods. All of these tools are practically realized in the "Minority Language Server," an online tool that brings these techniques together with low-resource language text on the Web. The Minority Language Server, starting with only a few words in a language can automatically collect text in a language, identify its language and tag its parts of speech. We hope that this system is able to provide a convincing proof of concept for the automatic collection and processing of low-resource language text from the Web, and one that can hopefully be realized before it is too late.
Keyword: Natural Language Processing
URL: http://hdl.handle.net/2027.42/113373
BASE
Hide details
26
Documentary Linguistics and Computational Linguistics: A response to Brooks
Bird, Steven; Chiang, David; Frowein, Friedel. - : University of Hawaii Press, 2015
BASE
Show details
27
Language Preservation 2.0: Crowdsourcing oral language documentation using mobile devices
Bird, Steven. - 2015
BASE
Show details
28
Computational support for early elicitation and classification of tone
Bird, Steven; Lee, Haejoong. - : University of Hawai'i Press, 2014
BASE
Show details
29
Computational support for early elicitation and classification of tone
Bird, Steven; Lee, Haejoong. - : University of Hawai'i Press, 2014
BASE
Show details
30
Collecting bilingual audio in remote indigenous villages
BASE
Show details
31
The International Workshop on Language Preservation: An Experiment in Text Collection and Language Technology
Bird, Steven; Chiang, David; Frowein, Friedel. - : University of Hawaii Press, 2013
BASE
Show details
32
The International Workshop on Language Preservation: An Experiment in Text Collection and Language Technology
Bird, Steven; Chiang, David; Frowein, Friedel. - : University of Hawaii Press, 2013
BASE
Show details
33
Effects of distributed practice on the acquisition of second language English syntax
In: Applied psycholinguistics. - Cambridge [u.a.] : Cambridge Univ. Press 32 (2011) 2, 435-452
BLLDB
OLC Linguistik
Show details
34
Tone in Usarufa: Field Recordings
Bird, Steven. - 2011
BASE
Show details
35
Equipping university students to document their ancestral languages
BASE
Show details
36
Book Review
In: Natural language engineering. - Cambridge : Cambridge University Press 17 (2010) 3, 419-424
OLC Linguistik
Show details
37
Effects of distributed practice on the acquisition of second language English syntax
In: Applied psycholinguistics. - Cambridge [u.a.] : Cambridge Univ. Press 31 (2010) 4, 635-650
BLLDB
OLC Linguistik
Show details
38
The human language project: building a universal corpus of the world's languages
In: Association for Computational Linguistics. Proceedings of the conference. - Stroudsburg, Penn. : ACL 48 (2010) 1, 88-97
BLLDB
Show details
39
The Big Australian Speech Corpus (The Big ASC)
Chetty, Girija; Cassidy, Stephen; Butcher, Andrew Richard. - : Causal Productions, 2010
BASE
Show details
40
Fast query for large treebanks
GHODKE, SUMUKH; BIRD, STEVEN. - : Association for Computational Linguistics, 2010
BASE
Show details

Page: 1 2 3 4 5 6...9

Catalogues
3
7
12
0
1
2
0
Bibliographies
27
0
1
1
0
0
0
0
3
Linked Open Data catalogues
0
Online resources
4
0
1
0
Open access documents
114
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern